Overview

Dataset Statistics

Number of Variables 10
Number of Rows 18560
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 4.4 MB
Average Row Size in Memory 248.5 B
Variable Types
  • Numerical: 5
  • Categorical: 4
  • DateTime: 1

Dataset Insights

id is uniformly distributed Uniform
extra_features_count is skewed Skewed

Variables


id

numerical

Approximate Distinct Count 18560
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 296960
Mean 10025.8738
Minimum 0
Maximum 20097
Zeros 1
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • id is uniformly distributed
  • id is skewed right (γ1 = 0.0055)

Quantile Statistics

Minimum 0
5-th Percentile 1010.95
Q1 5017.75
Median 10021.5
Q3 15013.25
95-th Percentile 19089.05
Maximum 20097
Range 20097
IQR 9995.5

Descriptive Statistics

Mean 10025.8738
Standard Deviation 5797.8281
Variance 3.3615e+07
Sum 1.8608e+08
Skewness 0.005533
Kurtosis -1.1949
Coefficient of Variation 0.5783

make

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1323400
  • The largest value (Nissan) is over 38.49 times larger than the second largest value (Nissan Motor Egypt)

Length

Mean 6.3039
Standard Deviation 1.8853
Median 6
Minimum 6
Maximum 18

Sample

1st row Nissan
2nd row Nissan
3rd row Nissan
4th row Nissan
5th row Nissan

Letter

Count 116060
Lowercase Letter 96560
Space Separator 940
Uppercase Letter 19500
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Nissan, Nissan Motor Egypt) take over 50.0%
  • The largest value (nissan) is over 39.49 times larger than the second largest value (egypt)

model

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1305841
  • The largest value (Sunny) is over 4.55 times larger than the second largest value (Qashqai)

Length

Mean 5.3578
Standard Deviation 0.7908
Median 5
Minimum 4
Maximum 14

Sample

1st row Juke
2nd row Juke
3rd row Juke
4th row Juke
5th row Juke

Letter

Count 99439
Lowercase Letter 80877
Space Separator 2
Uppercase Letter 18562
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Sunny, Qashqai) take over 50.0%
  • The largest value (sunny) is over 4.55 times larger than the second largest value (qashqai)

model_year

numerical

Approximate Distinct Count 24
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 296960
Mean 2016.3728
Minimum 1999
Maximum 2023
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • model_year is skewed left (γ1 = -1.0459)

Quantile Statistics

Minimum 1999
5-th Percentile 2008
Q1 2014
Median 2017
Q3 2020
95-th Percentile 2022
Maximum 2023
Range 24
IQR 6

Descriptive Statistics

Mean 2016.3728
Standard Deviation 4.3304
Variance 18.752
Sum 3.7424e+07
Skewness -1.0459
Kurtosis 1.6458
Coefficient of Variation 0.002148
  • model_year is not normally distributed (p-value 0.0008234630523542077)
  • model_year has 311 outliers

kilometers

numerical

Approximate Distinct Count 578
Approximate Unique (%) 3.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 296960
Mean 94310.145
Minimum 0
Maximum 285000
Zeros 236
Zeros (%) 1.3%
Negatives 0
Negatives (%) 0.0%
  • kilometers is skewed right (γ1 = 0.2984)

Quantile Statistics

Minimum 0
5-th Percentile 9999
Q1 41675
Median 90000
Q3 139999
95-th Percentile 200000
Maximum 285000
Range 285000
IQR 98324

Descriptive Statistics

Mean 94310.145
Standard Deviation 59968.2582
Variance 3.5962e+09
Sum 1.7504e+09
Skewness 0.2984
Kurtosis -0.7418
Coefficient of Variation 0.6359

transmission_type

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1370266
  • The largest value (Automatic) is over 16.54 times larger than the second largest value (Manual)

Length

Mean 8.829
Standard Deviation 0.6956
Median 9
Minimum 6
Maximum 9

Sample

1st row Automatic
2nd row Automatic
3rd row Automatic
4th row Automatic
5th row Automatic

Letter

Count 163866
Lowercase Letter 145306
Space Separator 0
Uppercase Letter 18560
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Automatic, Manual) take over 50.0%
  • The largest value (automatic) is over 16.54 times larger than the second largest value (manual)

price

numerical

Approximate Distinct Count 797
Approximate Unique (%) 4.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 296960
Mean 274881.1961
Minimum 1000
Maximum 1384000
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • price is skewed right (γ1 = 1.6041)

Quantile Statistics

Minimum 1000
5-th Percentile 129000
Q1 181000
Median 248000
Q3 338000
95-th Percentile 513000
Maximum 1384000
Range 1383000
IQR 157000

Descriptive Statistics

Mean 274881.1961
Standard Deviation 128976.204
Variance 1.6635e+10
Sum 5.1018e+09
Skewness 1.6041
Kurtosis 4.5143
Coefficient of Variation 0.4692
  • price is not normally distributed (p-value 4.389640203388992e-05)
  • price has 609 outliers

priced_at

datetime

Distinct Count 226.3906
Approximate Unique (%) 1.2%
Missing 0
Missing (%) 0.0%
Memory Size 296960
Minimum 2022-02-02 00:00:00
Maximum 2023-04-30 00:00:00

mileage_category

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 167533

Length

Mean 7.3572
Standard Deviation 1.7274
Median 8
Minimum 5
Maximum 9

Sample

1st row 200k+
2nd row 200k+
3rd row 0-50k
4th row 100k-150k
5th row 0-50k

Letter

Count 30872
Lowercase Letter 30872
Space Separator 0
Uppercase Letter 0
Dash Punctuation 17581
Decimal Number 87117
  • The top 2 categories (50k-100k, 0-50k) take over 50.0%

extra_features_count

numerical

Approximate Distinct Count 39
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 296960
Mean 12.4532
Minimum 1
Maximum 39
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • extra_features_count is skewed right (γ1 = 0.7882)

Quantile Statistics

Minimum 1
5-th Percentile 4
Q1 6
Median 9
Q3 18
95-th Percentile 26
Maximum 39
Range 38
IQR 12

Descriptive Statistics

Mean 12.4532
Standard Deviation 7.7952
Variance 60.7652
Sum 231132
Skewness 0.7882
Kurtosis -0.5406
Coefficient of Variation 0.626
  • extra_features_count is not normally distributed (p-value 6.0474176091435796e-21)
  • extra_features_count has 34 outliers

Interactions

Correlations

Missing Values